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ABSTRACT 


This  memorandum  gives  a  methodology  for  analysing 

attrition  data>  and  establishing  the  reliability  of  the  results. 

•  Underlying  assumptions  include  exponentially  distributed  times 

between  attritions  and  a  mean  that  increases  with  operating 

experience.  Principal  techniques  used  are  Maximum  Likelihi.  '>d 
Estimation  and  Jackknifing.  An  example  is  given,  drawn  from  F/A-  ' j 
aircraft  data. 
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INTRODUCTION 


1.  This  paper  is  en  outcome  of  studies  in  Aircraft  Attrition, 
the  bulk  of  which  were  completed  in  early  1985.  It  deals  with  some 
of  the  theoretical  aspects  of  using  a  limited  amount  of  data  to 
evaluate  the  parameters  of  a  commonly  accepted  'learning  curve' 
model,  and  to  determine  the  reliability  of  those  values.  The 
statistical  techniques  used  are  not  'state  of  the  art',  but  equally 
are  not  rudimentary.  Computer  assistance  in  the  application  of 
these  aspects  can  be  considered  1  ighly  desirable. 

2.  The  work  documents  and  demonstrates  an  approach  to 
Attrition  modelling  using  Maximum  Likelihood  Bstimatlon  (MLB)  and 
Jackknifing  with  a  worked  example  from  attrition  modelling  for  the 
P/A-18  aircraft.  Use  of  MLB  is  highly  recommended  for  situations  in 
which  Normal  distributions  are  not  applicable.  Jackknifing  is  an 
additional  technique  for  improving  the  reliability  of  parameter 
estimates  by  reducing  both  bias  and  the  effect  of  outliers,  in 
addition  it  can  allow  error  bounds  on  the  estimates  of  single 
parameters  to  be  approximated. 

Rationale 

3.  While  this  paper  has  been  written  principally  for  the 
attrition  analyst,  it  is  appropriate  to  outline  the  reasons  fur 
adopting  this  particular  approach. 

4.  Present  practice  in  some  areas  is  to  fit  (by  least  squares 
regression)  a  straight  line  to  cumulative  attrition  data. 
Unfortunately,  classical  line  fitting  processes  require  that  each 
data  point  be  substantially  Independent  of  all  others.  Cumulative 
data,  by  its  very  nature  cannot  be  independent  since  a  change  in  one 
value  will  alttr  all  those  values  that  come  after  it.  To  overcome 
this  problem  given  cumulative  data,  the  differences  between  adjacent 
observations  may  be  used  instead.  In  attrition  terms,  this  is  the 
time  between  losses. 

5.  Another  practice  in  some  areas  is  to  use  least  squares  for 
regression  or  point  estimation  to  determine  the  mean  or  trend  of 
time  between  losses.  Unfortunately  the  least  squares  approach 
requires  that  the  variation  of  the  data  about  the  point  or  trend 
line  be  generally  symmetric  and  preferably  approximately  Normally 
distributed.  Attrition  data  however  tend  to  have  exponentially 
distributed  times  between  losses.  This  is  a  phenomenon  that  has 
been  observed  in  practice,  and  which  is  supported  theoretically 
under  the  proposition  that  the  probability  of  loss  in  any  one  short 
time  interval  is  the  same  as  in  any  other,  for  periods  when  learning 
effects  can  be  neglected.  The  exponei,tlal  distribution,  as  it  stands 
or  even  when  transformed  by  taking  logs,  is  not  symmetric. 
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6.  Th*  approach  adopted  for  thla  paper  has  been  to  aeiume 
that  attrition  intervals  are  (locally)  exponentially  distributed  but 
with  a  mean  that  increases  with  flying  experience.  f?hile  an 
exponential  learning  process  has  been  used,  the  aore  general  Weibull 
distribution  has  not  been  adopted  since  it  restricts  theoretical 
learning  process  options.  With  some  additional  work  the  analyst  may 
follow  the  general  lines  of  this  paper  using  another  form  of 
learning  such  as  exponential  to  an  asymptote  etc.  A  frequently  used 
methodology  for  handling  parameter  estimation  in  non-Normal 
situations  is  that  of  Maximum  Likelihood  Bstimatlon  (MLB)  and  is 
adopted  here. 

7.  Data  from  distributions  which  have  long  tails  (e.g.  the 
exponential)  can  sometimes  have  atypical  values  (outliers)  that 
affect  the  estimate  of  the  parameter  required.  The  effect  of 
outliers  can  be  reduced  using  a  technique  called  Jackknifing.  In 
addition.  Jackknifing  produces  intermediate  values  which  are  more 
likely  to  be  approximately  Normally  distributed  that  the  original 
data.  when  this  is  the  case,  it  allows  classical  statistical 
methods  to  be  used  in  estimating  the  accuracy  with  vrhich  the  single 
parameters  have  been  found.  Other  methods  for  determining  accuracy 
are  often  difficult  to  apply.  A  further  advantage  of  Jackknifing  is 
that  it  reduces  any  bias  that  is  intrinsic  in  the  estimation 
process. 

Assumptions  for  Attrition  Modelling 

8.  Attrition  studies  generally  assume  that  over  short  periods 
each  individual  attrition  or  'event'  is  independent  of  any  other  and 
that  events  occur  with  equal  probability  in  equal  time  intervals. 
From  this  it  follows  that  the  Poisson  distribution  describes  the 
numbers  of  events  occurring  in  set  time  intervals  or,  equivalently, 
that  the  times  between  successive  events  are  Exponentially 
di'itributed  (over  short  periods). 

9.  Another  co  unon  assumption  is  that  over  long  periods  the 
attrition  rate  decreases  with  incra'*sing  operational  experience,  and 
that  this  decrease  is  exponential,  as  for  example  in  the  following 
equation : 


E  -  Ah'*  (1) 

where  E  is  the  expected  number  of  events  per  unit  of  operating  time, 
(e.g.  attritions  per  flying  hour)  and  h  is  the  cumulative  operating 
time.  A,  B  are  constants  requiring  statistical  evaluation.  This 
can  represent  a  form  of  learning  where,  with  experience,  fewer 
losses  occur.  The  learning  process  is  generally  fairly  slow, 
however,  and  may  even  depend  on  the  attritions  themselves  (i.e. 
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Equation  1  could  bo  on  opproxlaotlon  of  o  oorioo  of  stops).  Tho 
colativoly  slow  looming  is  assuaod  to  bo  such  that  infringoaont  of 
tho  roisson/Exponontial  roquiroaonts  can  bo  noglcetod.  This 
particular  approach  to  tho  probloa  also  has  considorablt  flexibility 
in  altornato  nodal  soloction  since  different  foras  of  learning 
aquation  can  bo  adopted,  using  the  sane  general  approach  (e.g. 
exponential  learning  to  an  asyaptote). 

10.  integrating  Equation  1  froa  tine  h,  to  h  gives  the 
expected  cuaulative  nunber  of  events  (C)  that  have  occurred  up  to 
tiao  h: 


A  (h^-*  -  hi-“) 

C  -  -  (2) 

1  -  B 

Since  it  is  a  cuaulative  value,  C  must  be  positive  and  continuously 
increase,  either  to  an  asyaptote  (1-B  negative)  or  indefinitely  (1-B 
positive).  In  addition  if  the  number  of  expected  events  per  unit 
time  is  decreasing  (i.e.  'learning'  is  faster  than  'burnout')  then  B 
must  be  positive,  although  negative  values  for  B  do  have  a  physical 
significance. 

11.  The  other  constant  (A)  in  Equations  1  and  2  must  also  be 
positive  since  both  cumulative  operating  time  (h)  and  number  of 
events  must  be  positive. 

12.  In  summary  then,  the  assumptions  are: 

a.  over  short  periods,  times  between  events  are  well 
approximated  by  the  exponential  distribution  and  thus 
numbers  of  events  per  unit  time  approximate  the 
Poisson  distribution; 

b.  learning  applies  in  an  exponential  manner 
(Equation  1);  and  is  a  factor  in  determining  the  mean 
of  the  exponential  distribution  at  any  given  time; 
and 

c.  constant  A  of  Equation  1  must  be  positive. 

13.  Although  the  Poisson  distribution  and  the  cumulative 
events  model  (Equation  2)  are  frequently  encountered  in  attrition 
modelling,  they  are  not  really  satisfactory  as  statistical  models 
since : 
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a.  In  this  context  tho  Bolsson  distribution  roquicct  an 
artificially  dtfinad  unit  of  tiaa  within  which  tha 
nuaibar  of  avanta  must  ba  countad;  and 

b.  in  tha  cunulativa  modal,  aaquantial  obaarvationa  ara 
not  indapandant. 

To  avoid  auch  problans,  tima  batwaan  avanta  may  ba  uaad  instead,  and 
tharafora  tha  modal  to  datarmina  tha  assential  paramatara  A  and  B 
isi 


A 

and  p{t)  -  -  (4) 

m 

whara  A  and  B  ara  constants  as  bafora, 

h  is  tha  cumulativa  oparating  axparianca, 

m  is  tha  maan  tima  batvean  evants, 

t  is  tha  tima  batwaan  avanta,  and 

p(t)  is  tha  probability  dansity  function  of  t. 


14.  Obsarvations  of  intar-evant  times  ara  ganarally  numbarad 

in  chronological  order.  This  numbering,  represented  by  '1'  is  then 
used  to  index  the  modal  variables  (m,  t  and  h).  Thus  the  time 
between  evants  is  related  to  cumulative  time  through: 

ti  -  h,  -  (?) 

where  h,  >  0  usually 

and  Equations  3  and  4  can  ba  rewritten: 
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h? 


A 


{€) 


P(tt) 


(7) 


«rh*r«  It  tht  euaulatlv*  optrttlng  txpttitnct  at  tvtnt  1, 
aj  it  tht  attn  Inttrtvtnt  tiat  at  avaitt  1,  and 
tj  la  tht  tlat  bttwttn  tvtnta  i-1  and  i. 


15.  Thut  Squationt  5  to  7  rtprtatnt  tht  ttatiatlcal  aodtl  of 
txpontntial  Itarnlng  (Equation  6)  with  an  txpontntially  diatributtd 
tiat  bttwttn  tvtnta. 

initial  Paraatttr  Eitiaation  by  WLE 

16.  It  it  coaaon  practlct  to  attest  the  parameters  of  models 
using  the  method  of  Least  Squares.  This  method  works  well  when  the 
distributions  underlying  the  data  are  Normal  or  even  when  they  are 
just  sycattrinal.  However  it  hat  been  found  that  skew  distributions 
such  as  the  txpontntial  do  not  respond  so  well,  in  this  cast  an 
alternative  methodology  can  be  evolved  using  naximum  Likelihood 
Bstlaation  (nlB)  techniques.  When  applied  to  fitting  straight  lines 
through  data  distributed  normally  across  the  line,  NLB  gives  the 
same  result  as  the  Least  Squares  method.  As  its  name  implies,  NLE 
yields  parametet.  value  estimates  which  maximise  the  likelihood  of 
occurrence  of  the  set  of  observed  values,  given  the  underlying 
distribution. 

17.  Following  the  techniques  for  NLB  outlined  in  Reference  4, 
the  method  for  this  problem  is  to  obtain  analytically  the  overall 
probability  of  all  the  n  observations  occurring.  That  is  to 
maximise 

^  "  Pi  Pj  •  •  •  Pi  •  •  •  Pn 


where  p^  arc  distributed  as  p(t^)  in  Equation  7  and  P  is  to  be 
maximised  with  respect  to  A  and  B.  It  can  be  shown  that  this  is 
equivalent  to  maximising: 
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Q  •  ln(P)  »  ^  ln(pj ) 
i 


•  2 

1 


ln<B^  ) ) 


(S) 


which  can  b«  solved  for  A  and  B  (taplicit  in  a^ )  by  using  two 
siaultansous  aquations  obtained  froa  partial  differentlationo  For  a 
naxiaua,  the  partial  derivatives  aust  be  serot 


—  -  0 

SA 

(9) 

>Q 

—  -  0 

as 


It  should  be  noted  that  these  derivatives  can  be  expanded)  so< 


and 


aQ 

aA 

ao 

aa 


^  Ua.  aA  J 

i 

V 

^  laxi  as  . 


(10) 


-  0 


Evaluation  of  the  derivatives  can  therefore  proceed  by  partial 
differentiation  of  Q  with  respect  to  ,  and  with  respect  to  A 
and  B.  Differentiation  of  Equation  8  with  respect  to  a^  yields: 


(11) 
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Expanding  at  in  Equation  8  allows  thaaa  aquations  to  ba  rawrittan 

as  I 


(15) 


2  ln(hi ) 


Evaluation  of  A  and  B  procaads  by  salactlon  of  a  valua  for  B,  which 
is  usad  to  avaluata  A  through  aaeh  of  Equations  15  and  16.  Tha 
diffaranea  batwaan  thaaa  two  A  valuas  is  than  usad  tc  pradict  a  naw 
Bt  aiming  for  saro  diffaranea  in  tha  A  valuaa.  Tha  proctss  is  than 
rapaatad  until  rquations  15  and  16  giva  tha  sama  rasults. 

18.  In  tha  avant  that  B  is  dafinad  by  soma  othar  aathod  or 
process,  (for  axaapla  through  data  from  aircraft  of  tha  sama  generic 
type)  Equations  9  to  14  still  apply,  thus  allowing  Equation  IS  to  ba 
usad  to  avaluata  A  directly. 


Improving  and  Assessing  Parameter  Estimates  by  Jackknitins 


19.  The  preceding  saction  has  shown  how  to  astimata  tha 
paramatars  A  and  B,  given  a  sat  of  data.  The  process  of 
Jackknifing,  in  general  terns,  re-assassas  such  paramatecs,  using 
only  a  reduced  data  set,  but  for  all  possible  reduced  data  sets. 
The  name  of  the  technique  is  said  to  have  arisen  from  the  fact  that 
the  blade  of  a  Jackknife  folds  back  into  itself,  just  as  the  set  of 
original  obsarvations  are  folded  back  into  themselves.  For  example, 
a  data  set,  of  say  n  observed  events,  reduced  by  one  observation, 
will  give  n  reduced  subsets  of  sisn  n-1  observations.  These  subset 
astinatas  are  then  suitably  combined  with  the  original  estimate 
(from  tha  whole  data  sat)  to  give  an  improved  paranatar  estimate. 
Also  of  importance  however,  is  the  fact  that  tha  distribution  of  tha 
subset  astinatas  is  often  such  that  Student's  t  distribution  can  ba 
applied  to  giva  a  first  approximation  to  tha  confidence  limits  on 
tha  improved  estimate.  It  should  ba  noted  that  there  is  no 
theoretical  assurance  of  tha  applicability  of  Student's  t  in  all 
circumstances,  but  reasonable  results  appear  likely  in  many 
practical  situations.  Maximum  likehood  estimati'tn  is  thought  to  be 
particularly  suitable  for  use  with  Jackknifing.  The  approach  used 
here  is  based  on  that  of  Reference  1. 
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20.  A  Cvw  points  nood  to  bo  aontionod  about  Jackknifing  and 
oxtroao  points.  Tho  gross  impact  of  tho  Jackknifo  in  roducing  tho 
offset  of  occasional  oxtroao  points  on  tho  ostiaato  should  not  bo 
undorratod.  For  oxaaplo,  givon  20  data  points  tho  chanco  that  all 
of  than  will  lio  inaido  tho  95  por  cont  band  is  only  about  1/3, 
(i.o.  0.95**),  and  thoso  outaido  it  aay  abnormally  bias  tho  aiaplo 
ostiaato.  Noto,  howovor,  that  ostiaatos  which  utilise  points  from 
tho  oxtroaos  for  their  result,  or  which  take  no  values  froa  such 
oxtroaos  (o.g.  order  statistics),  aay  not  respond  well  to 
Jackknifing.  Conversely,  ostiaatos  such  as  moans,  variancss  etc., 
for  skew  distributions  with  long  tails  should  to  respond  bettor  to 
tho  technique.  This  latter  case  is  tho  region  into  which  attrition 
modelling  fits. 

21.  Briefly,  tho  definitions  and  equations  required  to 
Jackknifo  by  dropping  single  observations  are  detailed  below.  A 
siaplified  and  modified  fora  of  the  notation  of  Reference  1  is  used 
in  order  to  clarify  the  presentation: 

a.  A  paraaeter  e  is  required  (e.g.  mean,  standard 
deviation,  constant  in  a  regression  line,  etc). 

b.  There  is  a  data  set  of  observations,  each  complete 

observation  identified  by  (i  >  1  to  n)  and  the 

whole  set  identified  by  X.  Subsets  are  identified  by 
X. 4  which  represents  X  less  observation  . 

c.  t,  is  the  estimator  of  B  from  X. 

d.  t.j  the  estimator  of  6  froa 

e.  The  Jackknifed  estimate  of  9  is  t* : 

n  -  1  ^ 

t‘  -  nt, - )  t.i  (17) 

n  ^ 
i 

If  confidence  intervals  are  required  then  additional  definitions  are 
needed: 
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f.  So  called  'pseudo-values': 

t!i  -  nt,  -  (n  -  Dt.i  (18) 

wMch  relate  to  the  Jackknifed  estimate  of  6  as 

i 

t*  .  - 

n 

g.  The  variance  of  t*  is  var(t*j: 

)*  -  n(t*  )* 

var(t  )  ■  - 

(n  -  1) 


which  is  the  conventional  definition  of  a  sample 
variance  for  observations  tl^.  This  may  be  simplified 
to: 


'  n 

2 

I 

1 

1-1 

.  n 

,  n  , 

(20) 


h.  Application  of  the  Student's  t  distribution  is  subject 
to  the  distribution  of  t_^  being  approximately  Normal, 
using  n-1  degrees  of  freedom  for  the  n  values  of  t_^ 
and  is  to 

(t*  -  0)-ln 

- - 7—  (21) 

'lvar(t  ) 


In  practice  the  pseudo  values  are  simply  scaled  and  translated  t_^ 
(Equation  18)  so  Normality  in  the  t_^  is  all  that  needs  to  be 
approximated . 


22.  Should  it  be  necessary  to  estimate  confidence  intervals  on 
more  than  one  parameter,  likelihood  ratio  tests  may  be  considered. 
If  it  is  felt  that  the  pseudo-values  are  drawn  from  a  multivariate 
Normal  distribution,  the  likelihood  ratio  tests  lead  to  the  test 
as  an  analogue  of  the  t-test.  Details  may  be  found  in  Reference  6. 


i 

E 

i 
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23.  There  are  some  other  very  broad  conditions  whic.  should  be 
met  for  the  technique  to  be  applicable.  Other  than  a 
dlfferantl».ijility  condition,  and  to  say  that  most  continuous 
parametric  statistical  situations  meet  the  conditions,  their 
discussion  is  beyond  the  scope  of  this  paper  (for  more  details  see 
References  1  and  2).  The  differentiability  condition  is  that  t 
estimates  a  function  that  after  differentiation  is  continuous. 
Amongst  other  things,  this  implies  that  the  estimator  be 
unconstrained,  and  therefore  the  constant  A  (from  the  learning 
model)  cannot  be  Jackknifed.  Constant  B,  while  conceptually 
constrained  can  in  fact  be  estimated  at  a  value  outside  its 
conceptual  constraints  through  an  abnormal  (but  possible)  data  set. 
To  solve  the  problem  for  A,  the  log  of  A  can  be  used,  giving 
multiplicative  rather  than  additive  confidence  intervals.  Solutions 
for  the  B  problem  will  be  specific  to  the  situation,  the  data  set 
and  the  certainty  on  the  bounds  of  B.  In  many  cases  it  may  be 
acceptable  that  'unlearning'  can  occur,  especially  in  the  case  of 
ageing  equipment,  new  operating  practices  or  conditions,  and  new 
operators.  The  concept  of  an  upper  bound  on  the  total  number  of 
events  may  also  be  acceptable,  particularly  if  only  a  limited  range 
of  cumulative  operating  experience  is  to  be  modelled.  Both 
logarithms  and  fractional  (e.g.  x/(a  +  bx))  transforms  can  be 
considered  if  necessary,  but  care  should  be  taken  as  they  will 
affect  the  confidence  limits  considerably. 

Example 

24.  Attrition  data  for  the  F/A-18  on  a  world  wide  basis, 
excluding  losses  that  are  not  flying  related  has  been  used.  As  much 
of  this  work  was  done  in  early  1985  when  only  five  losses  had  been 
sustained,  the  data  have  been  portioned  at  that  point.  Table  1 
gives  data  for  the  first  five  losses  and  Table  2  for  the  succeeding 
22  losses.  The  first  set  of  data  was  used  to  predict  the  loss  rate 
for  the  midpoint  of  the  period  covering  the  last  10  losses,  and 
compared  with  the  average  loss  rate  for  those  losses.  Parameters 
assessed  are  A  and  B  in  the  equation 


where  m  is  the  mean  Interevent  time,  and  h  is  the  cumulative  flying 
experience  (hours)  (Equation  3).  Studies  of  similar  us  aircraft 
indicate  that  a  good  a  priori  value  for  B  is  0.247.  Comparisons  of 
results  using  Least  Squares,  MLE  and  Jackknifed  MLE  have  been  given, 
as  well  as  the  effect  of  the  a  priori  Information. 
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Table  1.  FIRST  FIVE  F-18  LOSSES 


LOSS 

(i) 

CUMULATIVE  OPERATING 
EXPERIENCE  (h^ ) 

INTER  EVENT  TIME 
( t^  -  ~  h^  _  ^  ) 

0 

0 

- 

1 

3337 

3337 

2 

4379 

1042 

3 

27300 

22921 

4 

4580S 

18505 

5 

64717 

18912 

Table  2.  F-18  LOSSES  SIX  ONWARD 


LOSS 

(i) 

CUMULATIVE 

OPERATING 

EXPERIENCE 

(hj  ) 

INTER  EVENT  TIME 

(ti  -  hi  -  hi.i ) 

LOSS 

(i) 

CUMULATIVE 

OPERATING 

EXPERIENCE 

(hi  ) 

INTER  EVENT  TIME 

(ti  -  hi  -  hi.i) 

6 

75430 

10713 

17 

3b3676 

10133 

7 

124248 

48818 

18 

397609 

7933 

8 

143495 

19247 

19 

404867 

7258 

9 

154720 

1122S 

20 

456060 

51193 

10 

198658 

43938 

21 

462696 

6636 

11 

248657 

49999 

22 

472105 

9409 

12 

304747 

56090 

23 

490221 

18116 

13 

347378 

42631 

24 

490978 

757 

14 

349723 

2345 

25 

498631 

7653 

15 

379543 

29820 

26 

519600 

20969 

16 

379543 

0 

27 

519600 

0 

Equation  6  was  suitably  modified  (by  taking  logs  of  A  and  and 
estimating  log  m^  by  log  t^ )  then  Least  Squares  regression  was 
applied,  yielding: 


A  -  0.734 


B  -  0.891 
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Application  of  MLB  ylaldad: 

A  «  0.356  B  -  0.831 

And  using  the  Jackknife  with  the  HLE  estimates  gave: 

A  -  0.0571  B  -  0.655 

Should  a  value  of  B  be  selected  for  a  priori  reasons  then  only  one 
parameter  needs  to  be  estimated,  allowing  an  estimate  of  the 
confidence  interval  for  that  parameter  when  Jackknifed.  In  this 
Instance  previous  studies  of  United  States  Air  Force  (JSAF)  aircraft 
have  shown  that  over  hundreds  of  attritions  for  both  single  and  twin 
engine  F/A  type  aircraft  the  best  estimate  for  B  is  0.247.  Using 
this  value  gives  the  following  results: 

A  by  least  squares  *  0.00142 

A  by  HLE  *  0.00101 

A  by  MLE  Jackknifed  >  0.00096 

and  the  90  per  cent  interval  (with  4®  of  freedom,  t  <•  2.1) 
is:  0.0047  >  A  >  0.00036 


)5.  Repeating  this  work  for  the  whole  data  set  (Tables  1  and 

2)  produced  the  results  shown  in  Table  3. 


Table  3.  RESULTS  FOR  ALL  F-18  LOSSES 
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a.  Coaparlng  thasa  flguras  with  tha  aarliar  valuas  baaad  on 

Tabla  1  it  can  ba  aaan  that  tha  laast  variation  in  raaults  occurs  in 
tha  Jackknifad  hle  results,  axcapt  for  tha  one  paraaatar  case  where 
tha  diffarancas  in  tha  mle  case  and  the  Jackknifed  HLE  are  too  small 
to  ba  significant.  Further,  if  tha  a  priori  value  for  B  really  is 
tha  'true'  value  for  B,  then  the  Jackknifed  results  were  closer  to 
that  value  with  both  data  sets. 

27.  Figure  1  shows  tha  attrition  data  (Tables  1  and  2)  as 
tines  between  losses,  and  the  two  parameter  lines  ganarati^  *  by  the 
three  different  methods  using  the  Table  1  data.  Although  none  of 
the  lines  produces  an  excellent  result  for  the  Table  2  data,  it  is 
not  difficult  to  see  from  this  that  the  Jackknifed  xine  is  the  best 
predictor.  Using  the  a  priori  information  with  Table  1  data 
improves  the  prediction  as  shown  in  Figure  2.  It  is  difficult  to 
see  which  line  is  best,  based  on  the  Table  1  data  alone,  but  when 
all  data  are  included  with  these  lines,  it  is  clear  that  the 
Jackknifed  line  is  best  for  the  following  reasons: 

a.  the  MLE  and  Jackknifed  lines  from  Table  3.  b  -  0.247 
(based  on  27  losses)  would  be  indistingu  table  from 
line  J  Figure  2  (based  on  five  losses)  .  drawn  on 
Figure  2;  and 

b.  bearing  in  mind  the  shape  of  the  exponential 
distribution  and  the  fact  that  its  median  Is  closer  to 
sero  than  its  mean,  then  the  Least  Squares  line  is 
probably  the  poorer  predictor  because  it  has  13  points 
above  the  line  and  14  below. 

It  should  be  noted  that  although  the  differences  between  the  lines 
look  small,  the  figures  are  drawn  on  log  paper.  Mean  times  between 
events  as  estimated  by  lines  N  and  J  are  nearly  half  as  much  again 
as  those  estimated  by  line  L  in  Figure  2.  Table  3  indicates  almost 
a  fourfold  decrease  in  the  mean  times  estimate  for  the  Least  Squares 
result  over  the  other  two  methods.  These  are  factors  which  can  have 
a  major  impact  when  considering  attrition  buys.  A  further  advantage 
from  using  the  Jackknifing  method  is  the  availability  of  confidence 
intervals  on  the  estimates  which  have  been  illustrated  in  this 

example,  by  the  90  per  cent  confidence  interval  (Figure  2). 

28.  The  cumulative  losses  prediction  is  also  of  interest  and 

is  plotted  for  the  one  parameter  Jackknifed  HLE  method,  in  Figure  3 
along  with  the  data.  It  should  be  noted  that  the  confidence 

interval  relates  to  the  line  and  not  the  data,  and  that  a 

particularly  'good'  or  'bad'  run  of  losses  early  in  the  loss  history 

will  shift  all  the  following  losses  in  relation  to  the  line  and 
intervals  -  i.e.  the  data  points  are  not  independent  in  this  form. 


IS 
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29.  Xf  confldtncs  intervals  act  iaportant  it  aay  bt  nactssary 
to  check  that  a  Noraal  approxiaation  to  the  psuedo  values  is 
appropriate.  In  practice  it  aay  be  found  that  for  attrition 
aodelling  the  interval  overestiaates  a.  little  on  one  side  and 
underestiaates  a  little  on  the  other.  This  aay  be  due  to  the  long 
tail  of  the  exponential  distribution.  Since  a  conventional  variance 
definition  is  used  with  t*^  it  would  seea  reasonable  to  use 
non-paraaetric  statistics  as  an  alternative  for  interval  estiaation. 
Reference  5  gives  soae  such  aethods. 

30.  A  quick  check  on  the  appropriateness  of  an  assuaptlon  of 
Noraality  can  be  done  by  sorting  the  data  (t_^)  and  plotting  the 
result  at  equal  intervals  on  Noraal  Probability  paper.  Figure  4 
Illustrates  this  for  Table  1.  An  adequate  straight  line  fit  to  the 
data  indicates  Noraality.  The  values  for  all  t.^  calculated  for 
Table  1  data  are  shown  in  Table  4.  Included  in  the  table  are 
coluans  j  which  give  the  sorted  order  for  the  preceding  column. 


Table  4.  INTERMEDIATE  JACKKNIFE  RESULTS 


INDEX 

(i) 

ln(A)  RESULTS 

B  RESULTS 

ln(A)  RESULTS 
FOR  B  »  0.247 

COMMENTS 

j 

t-t 

j 

t-i 

j. 

0 

-1.03 

- 

0.83 

- 

-6.90 

- 

Pure  MLE  Result 
-  i.e.  t_, 
estimates  ln(A) 
and  B 

1 

2.66 

5 

1.18 

5 

-7.02 

2 

2 

-3.05 

1 

0.64 

1 

-7.09 

1 

3 

-1.52 

2 

0.76 

2 

-6.66 

5 

4 

-0.81 

3 

0.86 

3 

-6.82 

4 

5 

-0.15 

4 

0.93 

4 

-6.84 

3 

Noto. 

1.  Columns  j  give  the  sorted  t.^  order. 
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CONCLUSION  AND  SUHKAHY 


31.  This  paptr  has  brlafly  indlcatad  a  aathodology  that  may  ba 

followad  in  analysing  attrition  data,  that  is  prafarabla  to 
classical  laast  squaras  typa  approachas.  It  has  also  givan  tha 
rationala  and  undarlying  assumptions  bahind  tha  work  and 
damonstratad  (through  an  axampla  on  F/A-18  aircraft  attritions)  tha 
sorts  of  diffarancas  in  rasults  that  might  ba  axpactad. 

32 r  Tha  usa  of  NIB  mathods  ara  strongly  racommandad  in  this 

>  araa,  and  tha  usa  of  thx  Jackknifing  is  also  commandad  whathar  or 
not  HLB  is  usad.  A  program  has  baan  wrlttan  in  PASCAL  for 
attritions  (using  MLB  and  Jackknifing)  and  is  availabla  through 
Rafaranca  3. 
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